Towards more natural synthetic speech
نویسنده
چکیده
This article reports the results of two experiments in which factors such as duration, amplitude and noise are manipulated, in order to achieve more natural utterances in synthetic speech. The participants were native speakers of English, instructed to judge the naturalness of the different versions of utterances generated throughout the manipulations. The results indicate that there are signif icant individual preferences, as well as classification principles other than conventional ones. There is evidence to believe that further research in this area will render positive results in the search for naturalness. The same principles could be applied to search for naturalness in the prosodic structure of the synthetic utterances. Advancement in this area will surely render improvements in Spoken Dialogue Systems.
منابع مشابه
Evaluation of synthetic and natural Mandarin visual speech: Initial consonants, single vowels, and syllables
Although the auditory aspects of Mandarin speech are relatively more heavily-researched and well-known in the field, this study addresses its visual aspects by examining the perception of both Mandarin natural and synthetic visual speech. In perceptual experiments, the synthetic visual speech of a computer-animated Mandarin talking head was evaluated and subsequently improved. Also, the basic (...
متن کاملThe Temporal Delay Hypothesis: Natural, Vocoded and Synthetic Speech
Including disfluencies in synthetic speech is being explored as a way of making synthetic speech sound more natural and conversational. How to measure whether the resulting speech is actually more natural, however, is not straightforward. Conventional approaches to synthetic speech evaluation fall short as a listener is either primed to prefer stimuli with filled pauses or, when they aren’t pri...
متن کاملA Wavelet-Based Technique Towards a More Natural Sounding Synthesized Speech
This paper presents a wavelet-based technique to increase the quality and naturalness of LPC based synthesized speech signals. The proposed method is based on wavelet decomposition. We first obtain the wavelet coefficients, and then the variances of the wavelet coefficient at the last four scales (correspond the higher frequency region) of the synthetic speech are replaced by the original varia...
متن کاملSampling-Based Speech Parameter Generation Using Moment-Matching Networks
This paper presents sampling-based speech parameter generation using moment-matching networks for Deep Neural Network (DNN)-based speech synthesis. Although people never produce exactly the same speech even if we try to express the same linguistic and para-linguistic information, typical statistical speech synthesis produces completely the same speech, i.e., there is no inter-utterance variatio...
متن کاملPitch accent type matters for online processing of information status: Evidence from natural and synthetic speech∗ AOJU CHEN, ELS DEN OS AND JAN
Adopting an eyetracking paradigm, we investigated the role of H*L, L*HL, L*H, H*LH, and deaccentuation at the intonational phrase-final position in online processing of information status in British English in natural speech. The role of H*L, L*H and deaccentuation was also examined in diphonesynthetic speech. It was found that H*L and L*HL create a strong bias towards newness, whereas L*H, lik...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Procesamiento del Lenguaje Natural
دوره 29 شماره
صفحات -
تاریخ انتشار 2002